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ABSTRACT 


The  objective  of  this  thesis  research  is  to  develop  a  Back-Propagation  Neural  Network 
(BNN)  to  control  certain  classes  of  unknown  nonlinear  systems  and  explore  the  network's 
capabilities.  The  structure  of  the  Direct  Model  Reference  Adaptive  Controller  (DMRAC) 
for  Linear  Time  invariant  (LTT)  systems  with  unknown  parameters  is  first  analyzed  and  then 
is  extended  to  nonlinear  systems  by  using  BNN.  Nonminimum  phase  systems,  both  linear 
and  nonlinear,  have  also  been  considered. 

The  analysis  of  the  experiments  shows  that  the  BNN  DMRAC  gives  satisfactory 
results  for  the  representative  nonlinear  systems  considered,  while  the  conventional  least- 
squares  estimator  DMRAC  fails.  Based  on  the  analysis  and  experimental  findings,  some 
general  conditions  are  shown  to  be  required  to  ensure  that  this  technique  is  satisfactory. 
These  conditions  are  presented  and  discussed.  It  has  been  found  that  further  research  needs 
to  be  done  for  the  nonminimum  phase  case  in  order  to  guarantee  stability  and  tracking. 

Also,  to  establish  this  as  a  more  general  and  signiAcant  control  technique,  further 
research  is  required  to  develop  more  specific  rules  and  guidelines  for  the  BNN  design  and 
training. 
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I.  INTRODUCTION 


In  the  past  three  decades,  major  advances  have  been  made  in  adaptive  identification 
and  control  for  identifying  and  controlling  Linear  Time  Invariant  LTI  systems  with  unknown 
parameters.  The  choice  of  the  idendfier  and  controller  structures  is  based  on  well  established 
results  in  systems  theory.  The  adaptive  control  theory  is  then  improved  and  applied  to 
nonlinear  dynamic  plants  employing  neural  networks  with  the  right  choice  of  the  identifier 
and  controller  structures. 

The  objective  of  this  thesis  research  is  to  develop  a  Back-Propagation  Neural  Network 
(BNN)  to  control  certain  classes  of  unknown  nonlinear  dynamical  systems.  Initial  analysis 
is  directed  towards  a  Direct  Model  Reference  Adaptive  Controller  (DMRAC)  for  an 
unknown  LTI  system.  Results  of  the  simulation  are  displayed  for  this  system.  The  same 
analysis  is  then  performed  for  an  unknown  nonminimum  phase  system.  The  adaptive  control 
theory  is  then  applied  to  nonlinear  unknown  systems  by  employing  neural  networks. 
Basically,  four  nonlinear  models  have  been  analyzed  and  simulated.  Lastly,  the  adaptive 
control  algorithm  is  tried  with  nonlinear  unknown  nonminimum  phase  systems  again  by 
employing  neural  networks. 

In  what  follows,  Chapter  II  presents  the  analysis  of  a  DMRAC  for  an  unknown  LTI 
system  and  for  an  unknown  nonminimum  phase  LTI  system.  Chapter  III  concerns  neural 
networks  in  general  and  adaptive  control  of  nonlinear  plants  employing  neural  networks. 
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Chapter  IV  is  composed  of  the  simulations  that  have  been  carried  out.  The  results, 
conclusions  and  discussions  are  given  in  Chapter  V. 
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11 .  ADAPTIVE  CONTROL  OF  UNKNOWN  LTl  SYSTEMS 

A.  ADAPTIVE  CONTROL  IN  GENERAL 

The  search  for  design  techniques  to  control  systems  with  unknown  parameters  has 
drawn  much  attention  in  recent  years.  Adaptive  control  is  currently  one  of  the  most 
commonly  used  methods  in  the  control  of  systems  with  uncertain  dynamics.  Several 
applications  of  adaptive  control  such  as  ship  steering,  aircraft  control,  robot  manipulation, 
chemical  process  control  and  bio-medical  engineoing  have  been  performed  in  the  past  years 
[Ref.l].  The  general  structure  of  an  adaptive  control  system  is  shown  in  Figure  1. 

During  the  development  of  adiq)tive  control,  two  major  classes  have  emerged;  learning 
systems,  which  lead  to  the  introduction  of  learning  automatons  in  the  control  literature,  and; 
adaptive  systems  using  a  model  reference,  known  as  model  reference  adaptive  control 
systems  (DMRAC).  The  design  of  the  adaptation  algorithm  in  DMRAC  is  based  on  stability 
theory  since  the  stability  of  the  closed  loop  system  is  a  fundamental  requirement  in  the 
design  of  control  systems. 

In  adaptive  control  of  an  unknown  linear  time  invariant  (LTI)  system,  unknown 
parameters  are  estimated  by  an  on-line  estimator.  Based  on  the  estimated  parameters,  an 
adequate  design  can  be  achieved  to  implement  the  chosen  control  law.  This  process  is 
commonly  referred  to  as  indirect  adaptive  control.  Figure  2  shows  the  structure  of  an 
indirect  adaptive  control  system. 
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On  the  other  hand,  it  is  possible  to  parametrize  the  unknown  system  in  terms  of  the 
control  parameters  (e.g.  the  state-feedback  gains)  to  implement  the  chosen  control  law.  In 
this  case,  the  estimation  and  the  control  processes  are  carried  on  together.  This  alternative 
approach  is  called  direct  adaptive  control.  Figure  3  shows  the  structure  of  a  direct  adaptive 
control  system. 


CHOSIN  CONTROL  LAW 


Figure  1 .  General  Adaptive  Control  Structure 

The  main  idea  in  all  models  is  the  parameter  estimation.  When  estimating  the 
parameters,  the  uncertainties  must  be  expressed  linearly  in  terms  of  a  set  of  unknown 
parameters. 

In  linear  systems,  the  regression  can  be  adequately  used  to  obtain  the  state 
measurements  or  observations  from  the  systems,  with  the  unknown  parameters  as 
coefficients.  However,  in  nonlinear  systems,  nonlinear  functions  of  the  measurements  or 
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observations  are  generally  required.  With  unknown  nonlinear  systems,  these  nonlinear 
functions  cannot  be  specified  most  of  the  time.  So,  the  use  of  neural  networks  as  generic 
parametric  models  is  highly  recommended  in  such  cases. 


CHOSEN  CONTROL  LAW 


Figure  2 .  Indirect  Adaptive  Control  Algorithm 


CHOSEW 
CONTROL  LAW 


ESTIMATOR 
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■(t) 


UNKNOWN 

SYSTEM 
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Figure  3 .  Direct  Adaptive  Control  Algorithm 
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Before  developing  a  neural  network  based  direct  model  reference  adaptive  controller 
for  unknown  nonlinear  systems,  the  design  of  a  DMRAC  for  unknown  LTI  systems  will  be 
presented. 

B.  ANALYSIS  OF  A  DMRAC  FOR  UNKNOWN  LTI  SYSTEMS 

Consider  an  LTI  system 

A(q)y(t)  =  B(qm)>  (2-1) 

with  A(q)  and  B(q)  being  polynom  -U  operators'  with  unknown  coefficients.  For 
DMRAC  design,  it  is  assumed  that: 

1.  A(q)  is  monic  and  degree[Af^)]=n  is  known. 

2.  The  transfer  function  B(q)IA(q)  is  strictly  proper  and  the  relative  degree  is  known. 
The  goal  of  the  controller  is  to  move  the  closed  loop  system’s  dynamics  such  that 

D(q)y(t)=v(t),  (2-2) 

where  D(q)  is  the  arbitrary  monic  stable  characteristic  polynontial  operator  of  the  desired 
system,  and  v(t)  is  an  external  input.  The  general  structure  of  DMRAC  is  shown  in 
Figure  4. 

By  employing  a  steady-state  Kalman  filter,  observer- state-feedback  system  yields  the 
following  structure  for  the  feedback  controller. 


’The  argi  lent  q  of  the  polynomial  operators  is  the  forward  time-shift  operator  in  discrete 
time  modelling. 
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(2-3) 


u(t)  =  ^u(t)  ♦  *  m 

•(g)  •(g) 

where  a(g)  is  the  monic  characteristic  polynomial  operator  of  the  observer.  It  can  be 
chosen  arbitrarily,  provided  that  the  roots  are  in  the  stable  region.  The  polynomial 
operators  h(g)  and  are  the  feedback  polynomial  operators  of  the  unknown  system. 


Figure  4 .  General  Structure  of  the  DMRAC 
By  using  partial  state  representation,  equation  (2-1)  can  be  reconstructed  as 

A(g)z(t)  =  MfO, 
y(t)  =  B(g)z(t). 
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Combining  equations  (2-3)  and  (2-4)  yields  the  following: 

[u((0A(q>  -  h(q}A(q^  -  k(^B(q)i\z(t)  =  m(q)ii(t)y  (2-5) 

y(t)  =  mj(t) 

To  obtain  the  desired  closed  loop  behavior,  the  Diophantine  equation 

« {q)Aiq)  -  h(q)A{q)  -  kiq)B(q)  -  -^m(q)D(q)B(q) ,  (2-6) 

should  be  satisfied  so  that  the  closed  loop  system  poles  coincide  either  with  those  of  the 
reference  model  or  with  the  system's  zeros.  The  factor  I/b,  is  needed  to  ensure  that  the 
right  side  of  the  equation  is  monic  and  the  roots  of  B(q)  are  assumed  to  be  inside  the  unit 
circle  (i.e.  minimum  phase)  [Ref. 2]. 

In  equation  (2-6),  the  polynomial  operators  Afg)  and  B(q)  are  assumed  to  be  relatively 
co-prime  (i.e.  there  is  no  pole-zero  cancellation)  which  guarantees  a  unique  solution  for  h(q) 
and  k(q). 

Since  A(q)  and  B(q)  are  unknown  polynomial  operators,  an  estimator  is  required  to 
estimate  the  system  parameters  on-line,  based  on  the  generic  recwsive  regression  analysis. 
The  regression  equation, 

a (q)u(t)  =  h(q)u(t)  +  k(q)y(t)  *  —a (q) D(q)y(t) ,  (2-7) 
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is  obtained  by  using  partial  state  transformation. 

Using  9  as  the  forward  time-shift  operator,  the  filtered  input  and  output  signals  can  be 
defined  as 


g-'a(q)y‘'(t)  =  y(t), 
g-'‘a(g)u’'(t)  =  u(t). 


(2-8) 


Equation  (2-7)  can  be  expressed  in  a  more  convenient  form  as  follows: 


g-'u(t)  =  g-^'‘*'>h(g)u‘'(t)*g-<'*'>k(g)y‘'(t)*l^-'D(g)y(t), 


(2-9) 


where  n  is  the  order  of  the  unknown  system  and  r  is  the  number  of  .ae  closed  loop 
system  poles  which  must  be  placed  to  match  those  of  the  reference  model.  Equation  (2-9) 
can  be  represented  in  a  matrix  formation  as 

g-'um  = 


where 


*(t)  = 


u’'(t-r-J) 

u^(t-r-2) 

u’’(t-r-n*I) 

y^'a-r-i) 

y^(t-r-2) 

y’^(t-r~n*l) 

[g''D(g)y(t) 


*2 

\n-l) 

*2 

\n-i) 

b. 


(2-10) 
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By  using  the  linear  regressor  #('!>  and  recursive  estimation  of  as 


and. 


(2-11) 


(2-1 


the  unknown  parameters  of  the  system  are  estimated.  Hence,  the  control  equation  (2-3) 
can  be  rewritten  as 


v(t)  =  "* h(q)u ^(t)  * k(q)y  ^(t)  *—v(t). 


Equation  (2-13)  can  be  represented  as 

u(t)  = 


*j(t)  = 


u  ^(t-n*I) 
y''(t-2) 


y^'it-n*!) 


(2-13) 


(2-14) 
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It  can  be  noticed  that  all  the  equations  in  both  estimation  and  control  phases  are  the 
same  except  for  the  last  element  in  the  vector  •fOand  Hence,  this  identical  structure 
of  the  estimator  and  the  controller  prevents  the  unnecessary  intermediate  control  design 
calculations  and  speeds  up  the  control  process. 

Figure  S  illustrates  the  estimation  and  the  control  algorithm  of  the  DMRAC  where 
is  a  linear  associative  memory  with  recursive  estimation  to  minimize  the  mean  square 
errors  between  uf<>  and  u(t)  ~ 


Figure  5 .  Estimation  and  Control  Algorithm  of  the  DMRAC 


As  an  illustration.  Appendix  A  contains  a  numerical  example  of  the  design  of  a 
DMRAC  for  an  unknown  minimum  phase  LTI  system. 
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C.  ANALYSIS  OF  A  DMRAC  FOR  UNKNOWN  NONMINIMUM  PHASE  LTI 
SYSTEM 

Consider  the  LTI  system  in  section  B,  widt  polynomial  operators  A(q)  and  B(q)  which 
cause  the  system  to  be  nonminimum  phase  (i.e.  at  least  one  zero  outside  the  unit  circle  in  z- 
domain). 

Defining  the  polynomial  operators  and  B(q) 


Mq)  =  ♦ . ♦ 

m  =  q''*b,q<'‘-»* . 


(2-15) 


and  using  the  same  structure  for  the  feedbiu:k  controller  in  equation  (1-3),  the  following 
Diophantine  equation 

h(q)A(q)  *  k(q)B(q)  =  m(q)[A(q)  -  P*(q)J,  (2-16) 

is  obtained.  a(q)  is  the  monic  characteristic  polynomial  operator  of  the  observer.  It  is 
also  arbitrary  and  stable.  P*(q)  is  the  stable  arbitrary  characteristic  polynomial  of  the 
desired  closed  loop  system. 

In  nonminimum  phase  systems,  for  estimating  h(q)  and  k(q)y  the  filtered  partial  state 
2^(0  must  be  replaced  by  an  estimate  in  terms  of  the  available  signals  (t)  and  (t).  This 
can  be  done  using  the  Bezout  identity  as  follows: 


b(q)B(q)*c(q)A(q)  =  /, 


(2-17) 
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which  holds  provided  A(q)  and  B(q)  are  mutually  co-prime  polynomials.  Hence,  defining 
b(q)  and  c(q)  as 


♦  ••  ♦  K » 

c(q)  =  q’^*c,q<‘‘"‘^  *e^, 

Equation  (2-10)  can  be  rewritten  as 

•(q)u(t)  =  •'‘roe.. 


u''(t-r-2) 

I 

I 

u  ^(t-r-n*I) 
y^(t-r-2) 

I 

y^(t-r-n*I) 

♦ro  =  ^ 

u^(t-r-2) 

(t-r-n*I) 

f(t-r-I) 

y^(t-r-2) 


(2-18) 


(2-19) 
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where 


u^(t)  =  P*(q)m(q)u’'(t), 

/r/)  =  P*(q)*(q)y^(t). 

So,  the  algorithm  in  order  to  obtain  a  DMRAC  for  unknown  LTI  system  is  developed 
by  using  an  adaptive  pole  placement  algorithm  for  unknown  nonminimum  phase  LTI 
systems. 

Using  equation  (2-17),  the  state  z(t)  can  be  estimated  as 

zfl)  =  e(9)  u(t)  ♦  b(9)y(t)  ,  (2-20) 

which  implies  that  b(q)  and  c(q)  arc  the  parameters  of  the  observer. 

Appendix  B  contains  a  numerical  example  of  the  design  of  a  DMRAC  for  an  unknown 
nonminimum  phase  LTI  system. 
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III.  NEURAL  NETWORKS  IN  ADAPTIVE  CONTROL  OF 


UNKNOWN  NONLINEAR  SYSTEMS 

A.  NEURAL  NETWORKS  IN  GENERAL 

Neural  networks  are  composed  of  many  simple  elements  operating  in  parallel.  These 
elements  are  inspired  by  biological  nervous  systems.  The  network  function  is  determined 
largely  by  the  connections  between  the  elements  [Ref.3]. 

Neural  networks  have  been  trained  to  perform  complex  functions  in  various  fields  of 
applications  including  pattern  recognition,  identification,  classification,  speech,  vision  and 
control  systems.  Neural  networks  have  been  studied  for  many  years  in  the  hope  of  achieving 
human-Uke  performance  in  the  fields  of  ^leech  and  image  recognition  [Rcf.4].Today  neural 
networks  can  be  trained  to  solve  problems  that  are  unsuitable  to  conventional  computers 
IRef.3]. 

A  neural  network  is  usually  a  layered  network  consisting  of  an  input  layer,  an  output 
layer  and  at  least  one  layer  of  nonlinear  processing  elements.  The  nonlinear  processing 
elements,  which  sum  incoming  signals  and  generate  output  signals  according  to  some 
predefined  function  are  called  neurons.  The  neurons  are  connected  by  terms  with  variable 
weights.  The  output  of  one  neuron  multiplied  by  a  weight  becomes  the  input  of  an  adjacent 
neuron  of  the  next  layer. 


A  single  neuron  with  it  inputs  is  shown  in  Figure  6.  The  individual  input  j^), 
weighted  by  the  elenient  MJJ)  of  the  matrix  w,  are  summed  to  form  the  weighted  inputs  to 


the  transfer  function  F.  The  neuron  has  a  bias  b  and  an  output  y  given  by 


J'  =  r  [52  *(j)  *  *1  • 


(3-1) 


As  in  equation  (3-1),  the  transfer  function  net  input  is  the  sum  of  the  weighted  inputs 
and  the  bias  b.  This  sum  is  the  argument  of  the  transfer  function.  The  weight  vector  w,  and 
the  input  vector  x  can  be  represented  as 


w  =  [  —  Miin)  1 ,  X 


'x(l) 

x(2) 


[xfrt>J 


(3-2) 


Figure  6.  A  Single  Neuron  Model 
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Two  or  more  of  these  neurons  may  be  combined  in  a  layer  and  a  particular  network 
might  contain  one  or  more  such  layers.  First  consider  a  single  layer  of  neurons  where  each 
element  of  the  input  vector  x  is  connected  to  each  neuron  input  through  a  weight  matrix  w. 
In  this  case,  there  are  m  neurons  and  each  of  them  has  a  summer  and  the  summer  outputs 
form  an  m  element  vector  z.  The  transfer  function  net  input  is  the  sum  of  its  appropriately 
weighted  inputs  and  bias  b.  At  the  end,  all  neuron  outputs  form  an  m  element  output  vector 
y.  A  one  layer  neural  network  with  n  inputs  and  m  neurons  is  shown  in  Figure  7.  So, 
equation  (3-1)  can  be  represented  for  this  case  as 


y  =  r 


!•/  J^l 


(3-3) 


All  weights  can  be  represented  in  matrix  format  as 


MU) 

MU)  ... 

..  Mi/t) 

M2J) 

M2;2)  ... 

..  M2/t) 

••• 

MtU)  ... 

..  Mats), 

where  each  row  represents  the  weights  of  one  layer. 

A  network  can  have  several  layers  where  each  layer  has  a  weight  matrix  w,  a  bias 
vector  b,  a  weighted  input  z  to  the  transfer  function,  and  an  output  vector  y.  Layers  whose 
outputs  are  the  network's  outputs  are  called  output  layers.  All  other  layers  are  called  hidden 
layers.  Commonly  an  input  vector  is  presented  to  a  network,  the  outputs  are  calculated  and 
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an  algorithm  is  applied  to  determine  the  weight  element  changes.  However,  one  may  want 
to  apply  more  than  one  input  vector  simultaneously  and  get  the  network's  response  to  each 
one  of  them.  This  operation  is  called  batching  and  this  will  not  be  part  of  this  thesis. 


Figure  7  .  A  one  Layer  Neural  Network 


Initialization  of  the  weights  and  bias  elements  to  small  positive  and  negative  values 
provides  enough  variation  in  the  weights  and  biases  so  that  neurons  in  the  network  start  out 
with  a  range  of  behaviors  that  can  be  taken  advantage  of  by  the  learning  rule. 

There  are  several  learning  rules  such  as  Hebbian,  Instar,  Kohonen,  Outstar  learning 
rules  [Ref.3].  Among  the  most  common  learning  rules  is  backpropagation  which  adjusts  the 
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weights  and  biases  of  the  network  in  order  to  minimize  the  mean  squared  error  criterion. 
This  is  a  gradient  algorithm  and  it  is  done  by  continually  adjusting  the  values  of  the  weights 
and  biases  in  the  direction  of  the  gradient  of  an  appropriate  cost  function. 

The  most  distinctive  and  appealing  feature  of  many  neural  networks  is  that  they  learn 
by  examples. 

Currently,  the  most  popular  and  commonly  used  neural  networks  for  control  system 
design  is  the  Back-Propagation  Neural  Networks  (BNN)  which  is  discussed  in  the  following 
section. 

B.  ANALYSIS  OF  BACK-PROPAGATION  NEURAL  NETWORKS 

A  Back-Propagation  Neural  Network  is  a  multilayer,  feed-forward  network  which  has 
an  input  layer,  an  output  layer  and  at  least  one  hidden  layer.  Neurons  are  found  in  the  output 
and  hidden  layer(s),  while  the  input  layer  has  only  input  connections  feeding  the  neurons  in 
the  first  hidden  layer.  There  exists  no  feedback  or  even  interconnection  between  neurons  in 
the  same  layer.  There  is  generally  a  bias  input  for  each  neuron  with  an  associated  non-zero 
weight  A  three  layer  network  is  shown  in  Figure  8.  It  has  one  output  layer  and  two  hidden 
layers. 

In  Figure  8,  wi(jji)  is  defined  as  the  connection  weight  for  the  path  from  the/*  neuron 
in  the  layer  to  the  it*  neuron  in  the  ^*  layer.  It  is  assumed  that  the  weight  vector  wi(j,k) 
is  a  constant  so  that  a  partial  derivative  can  be  rigorously  defined  [Ref.5].  Then  the  BNN  in 
Figure  8  can  be  represented  mathematically  as 
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y3  =  Tl(w3*T2(w2*T3(wl*x  *  bl)*  b2)*  M), 


(3-5) 


where  x(j)  is  the  input  vector  to  the  BNN. 

In  the  learning  process,  the  BNN  adjusts  weights  wi  for  all  i,  to  minimize  a  suitable 
function  of  the  error  between  the  output  y  and  a  desired  output  y^  where  y/t)  is  the  signal  to 
be  approximated  given  the  input  xfl),  f 


3l{l) 


bl(ml) 


Figure  8.  A  Multilayer  Neural  Network 
The  most  common  error  function  used  is 

2  f.i 


(3-6) 
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where  is  the  signal  to  be  approximated  given  the  input  x(t},  t  *  and  y(t\B}  the 

output  of  the  neural  network  with  parameter  • .  A  general  picture  of  this  mechanism  is 
given  in  Figure  9. 

The  BNN  implements  a  modification  of  die  gradient  descent  algorithm  to  update  each 
weight  at  time  t+J  as  follows: 

,  (3.7) 


where  /i  >  0  is  the  learning  rate  and  6  is  the  weight  vector  and  e(t)  =  y(t\B)  ~  y/t) . 


Figure  9.  Obtaining  the  Error  in  BNN 


A  general  flow  diagram  for  a  BNN  is  given  in  Figure  10.  The  algorithm  that  is  used 
to  simulate  the  BNN  is  presented  below.  The  MATLAB  codes  to  accomplish  this  task  are 
in  Appendix  C.  For  simuladon  purposes,  a  two  layer  neural  network  is  chosen  which  is  quite 


effective  ctmipared  with  thiee  or  nxxe  layered  networks.  A  sigmoid  is  used  as  the  nonlinear 
transfer  function. 


hi 


Figure  10.  A  General  Flow  Diagram  for  BNN 


In  order  to  derive  the  back-propagation  algorithm,  the  weights,  biases,  input  and  ou^ut  are 
defined  as 


3^  =  ^]. 


(3-8) 
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The  input  to  the  first  layer  is  given  as 


v‘  -  wl*u  ♦  bl 


V 


/ 


«ii 

< 

u* 

^12 

(3-9) 


Hence,  the  output  of  the  first  layer,  v  can  be  represented  as 


v=s^moid  (v0  = 


Likewise,  the  input  to  the  second  layer  is  given  as 

z'  =  w2*v*  b2 


z'  = 


|C,f  Cj2 


ICji  C22 


'21 


b. 
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CijV,*C,2V2*b22 


Vl  *  ^2^2  *  ^22 


And  the  output  of  the  second  layer  can  be  given  as 


z  =  sigmoid  (z')  = 


So,  the  output  of  the  network  can  be  written  as 


y=w3  *  z  =  [fjj  fj2  ] 


=///**/  *ft2*h 


(3-10) 


(3-11) 


(3-12) 


(3-13) 
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The  error  can  be  calculated  as 


(3-14) 


Back-Propagating  the  error  through  the  network,  the  followings  are  obtained: 


d2  -  ^Uag  (w3**d3)*ds^  (z) 


dl  =  diag  (w2'*d2)*dsig  (v0> 


(3-15) 


Partial  differentiations  of  an  appropriate  cost  function  with  respect  to  all  weight  vectors 
multiplied  by  a  proper  learning  rate  are  subtracted  from  the  weight  vectors  of  the 
previous  iteration  to  obtain  the  new  weight  vectors  as  follows: 

w3=w3-ft  *  •^^-w3-/i(d3*z') 

Bw3 

w2=w2-fi  *  '^^sw2-/i(d2*v')  (3-16) 

dw2 

wl=wl-fi  *  =  wl  -  ft(dl  *  u')  . 

Bwl 


In  a  similar  way,  we  can  show  that  for  the  bias  terms. 


b2^b2-M*—=  b2-M*d2  , 
db2 


bt^bl-ft*-^^  M-ffdl 

dbl 


(3-17) 
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IV.  APPLICATIONS  AND  SIMULATIONS 

A.  EXPERIMENTING  WITH  THE  BNN  DMRAC  FOR  UNKNOWN  MINIMUM 

PHASE  SYSTEMS 

In  this  chapter,  the  results  of  the  experiments  using  the  BNN  DMRAC  on  various 
nonlinear  SISO  systems,  are  presented.  Four  experiments  are  conducted  using  software 
simulations  for  the  four  classes  of  unknown  nonlinear  systems  considered  in  Chapter  III. 
The  main  purpose  of  these  experiments  is  to  see  under  what  conditions  the  proposed  BNN 
DMRAC  works.  The  software  simulation  programs  used  in  these  experiments  are  listed  in 
Appendix  C. 

Lastly,  an  experiment  for  an  unknown  nonlinear  nonminimum  phase  system  is 
conducted  based  on  the  system  Model  2. 

As  stated  in  [Ref.l],  four  important  classes  of  unknown  nonlinear  SISO  systems  are 
considered  for  direct  adaptive  control  using  tlw  BNN.  They  are  modeled  in  discrete-time  for 
analysis  and  simulation.  These  are  the  system  models  used  in  [Ref.6]  for  which  BNN 
indirect  adaptive  control  has  been  successfully  demonstrated.  The  four  models  are  the 
following  with  F,  a  continuous  smooth  function: 

n )  Model  1: 

M-  1 

fl*  y(t-  k)*T  (4-l) 

M 
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Large  mechanical  systems,  hard  nonlinearides  such  as  input  saturation,  dead  zones  or 
backlash  are  readily  described  by  this  model  [Ref.l]. 

(2)idQSlsL2: 


y(t*l)  *  r  +53**  "(t-  k)  (4.2) 

M 

The  action  of  viscous  drag  on  an  underwater  vehicle  can  be  modelled  by  this  model  [Ref.  1  ]. 
(31  Model  3  : 

y(t*l)  =  r  {y(t)j(t‘  n *1)]  *  r  lu(t)Mt-  />]  (4-3) 

Underwater  vehicles  subjected  to  input  saturation  and  viscous  drag  can  be  formulated  by 
this  model  [Ref.l]. 

(4)  Model  4: 

y(t* /j  =  r  m*  />]  (4.4) 

Bilinear  systems  are  part  of  this  model  [Ref.l]. 

1.  Experiment  1:  System  Model  1 

In  the  first  experiment,  a  nonlinear  system  described  by  Model  1  was  controlled 
by  a  BNN  DMRAC.  The  chosen  nonlinear  system  is  given  in  equation  (4-1)  as 

=  03y(t)  *  0.6y(t-I)  *  u(tf  *  OJ  u(tf  -  0.4 u(t)  .  (4-5) 
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According  to  the  suggested  training  fu-ocedures,  the  BNN  was  first  trained  off-line. 
The  training  set  made  of  a  randomly  generated  training  input  y(t)  and  the  resulting  ouqMit 
y(t)  of  the  neural  network  and  also  the  extmal  input  v(t)  and  the  desired  response  of  the 
system,  yjt)  are  shown  in  Figure  1 1 .  The  Training  set  consists  of  10  000  data  points  each 
for  the  input  and  the  output  measurements.  The  external  input  to  the  system,  v(t)  is  a  sum 
of  sinusoids  with  different  magnitudes  and  frequencies.  The  BNN  was  next  placed  on-line 
to  control  tlw  system.  During  the  on-line  control  i^iase,  the  BNN  recursively  learns  to  adapt 
to  the  required  control  structure.  The  ouqrut  of  the  controlled  system  compared  to  that  of  the 
reference  model  is  displayed  in  Figure  12.  The  reference  model  used  in  the  experiment  is 
chosen  as 

yJt)^0MyJt-l>*v(t-l)  ,  (4-6) 

where  v(t)  is  the  external  input  to  the  system. 

2.  Experiment  2:  System  Model  2 

In  the  second  experiment,  a  nonlinear  system  described  by  Model  2  was  used. 
It  is  governed  by 


.  (4-7) 

I  *  y(tf  *  y(t~  If 

The  input  to  the  BNN  estimator  will  be  the  regressor  vector  tPfO  of  equation  (2-10). 

in  equation  (2-14)  is  the  input  vector  during  the  control  phase.  The  same  reference 
model  and  the  same  procedures  for  off-line  training  and  on-line  control-plus-leaming 
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were  employed.  The  BNN  estimator  was  first  trained  off-line  with  a  10  000  point  training 
set  The  training  input  u(t)  and  the  resulting  output  y(r>  of  the  neural  iMtwork  and  also  the 
external  input  vfl)  and  the  desired  response,  yjt)  of  the  system  are  shown  in  Figure  13. 


Pigur*  11.  The  Training  Input  the  Resulting  Ou^mt  the  External  Input  and  the  Desired 
Response  for  the  System  Model  1 
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Figur*  12.  The  Ouq>ut  of  the  Controlled  System  Compared  To  That  of  the  Reference 
Model  for  System  Model  1 
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Pigur*  13.  The  Training  Input,  the  Resulting  Output,  the  External  Input  and  the  Desired 
Response  for  the  System  Model  2 
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Finally,  the  BNN  was  placed  on-line  to  control,  and  learn  the  control  structure  and 
estimate  the  parameters  simultaneously.  As  shown  in  Figure  14,  the  system  with  the  BNN 
DMRAC  successfully  tracks  the  model  reference  system  closely.  To  optimize  the 
performance  of  the  control  system,  many  different  learning  parameters  and  training  data 
were  tried.  However,  once  a  trained  BNN  works,  it  tracks  the  reference  model  reasonably 
well  for  the  inputs  with  similar  characteristics. 

3.  Experiment  3:  System  Model  3 

In  this  experiment,  a  nonlinear  system  described  by  Model  3  is  chosen  as 

yft  ♦  7)  =  +  u(tf  .  (4-8) 

The  same  procedures,  used  for  the  previous  experiments  were  conducted.  The  training 
input  u(t)  and  the  resulting  output  y(t)  of  the  neural  network,  and  also  the  actual  input 
v(t)  and  the  desired  response  yjt)  of  the  system  are  shown  in  Figure  15.  The  comparison 
between  the  actual  system  output  and  the  reference  model  output  are  displayed  in  Figure 
16. 

4.  Experiment  4:  System  Model  4 

In  the  final  experiment,  the  nonlinear  system  is  governed  by: 

y(t+I)  =  t)y(t-2)  u(t- 1)  \y(t-2)*  I]*  u(t) 

1  yit  1)2  y{t  2)2 
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Other  than  the  order  of  the  system,  there  is  not  much  change  in  the  input  vectors.  Once 
again,  the  same  procedures  were  applied  to  this  nonlinear  model.  The  training  input  i  (t)  and 


Figure  14.  The  Output  of  the  Controlled  System  Compared  To  That  of  the  Reference 
Model  for  System  Model  2 
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the  resulting  output  of  the  neural  network,  and  also  the  actual  input  v(t)  and  the  desired 
response  are  displayed  in  Figure  17.  The  output  of  the  controlled  system  compared  to 
that  of  the  reference  model  is  displayed  in  Figure  18. 


Plgurs  15. The  Training  Input,  the  Resulting  Output,  the  External  Input  and  the 
Desired  Response  for  the  System  Model  3 


(S  O  'O 
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Flgur*  16 .  The  Output  of  the  Controlled  System  Compared  To  That  of  the  Reference 
Model  for  System  Model  3 
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Figure  17.  The  Training  Input,  the  Resulting  CXitput,  die  External  Input  and  the  Desired 
Response  for  the  System  Model  4 
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Pigura  18 .  The  Output  of  the  Controlled  System  Compared  To  That  of  the  Reference 
Model  for  System  Model  4 


37 


B.  EXPERIMENTING  WITH  THE  BNN  DMRAC  FOR  UNKNOWN 

NONMINIMUM  PHASE  SYSTEMS 

In  this  section,  the  BNN  DMRAC  is  extended  to  nonminimum  phase  systems.  For 
experiment  purposes.  Model  2  is  considered. 

The  pole  placement  algorithm  [Ref.?]  which  was  used  for  LTI  nonminimum  phase 
systems  was  tested  on  nonlinear  nonminimum  phase  systems.  But  the  experiment  showed 
that  nonlinearities  affect  the  results  and  cause  the  system  to  go  unstable  especially  in  the 
nonminimum  phase  case. 

So,  it  was  decided  not  to  consider  the  pole  placement  approach  for  this  case  and  the 
BNN  DMRAC  is  direcdy  applied  to  a  nonlinear  nonminimum  phase  system.  The  simulations 
yield  fairly  reasonable  results  which  are  displayed  in  Figure  19  and  in  Figure  20. 

The  unknown  nonlinear  nonminimum  phase  system  used  is  chosen  as 


y(t*i) 


(4-6) 


From  the  following  plots  we  see  that  in  some  cases  the  output  of  the  system  is  able  to 
track  the  desired  reference  signal.  However,  in  several  runs,  the  nonminimum  phase  system 
does  not  perform  as  desired  and  it  might  become  unstable.  The  extension  of  a  neural  network 
controller  to  this  class  of  systems  is  still  an  open  problem  in  the  control  system  literature. 
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Figure  19 .  The  Training  Input,  the  Resulting  Ouq)ut,  the  External  Input  and  the 
Desired  Response  for  the  Nonlinear  Nonminimum  Phase  Unknown  System 
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Figur*  20 .  The  OutfNit  of  the  Controlled  System  Compared  To  That  of  the  Reference 
Model  for  the  Nonlinear  Nonminimum  I%ase  System 
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C.  IMPLEMENTATION  OF  THE  BNN  SOFTWARE  SIMULATOR 

For  this  thesis  research,  a  software  BNN  simulator  was  developed.  Until  neural 
network  hardware  systems  or  neurocomputers  become  commonly  (and  economically) 
available,  most  researchers  will  work  with  software  simulators  for  neural  networks.  The 
software  approach  offers  the  full  flexibity  for  development,  allowing  the  user  to  exercise  and 
experiment  freely  with  the  various  features  of  the  neural  network  [Ref.lJ.  The  main 
drawback  of  this  approach  is  the  slow  learning  process  of  the  simulator. 

By  using  the  appropriate  MATLAB^  codes,  the  following  functions  have  been 
developed  and  used  in  simulations: 

*  INITIAL  2  :  This  function  initializes  the  BNN.  It  takes  as  input  variables  the 
parameters  describing  the  number  of  inputs  and  number  of  outputs,  number  of  neurons  in 
the  hidden  layer(s).  It  yields  as  ouqtuts  the  initial  conditions  of  the  weights  and  biases  of  the 
network. 

*  NET  2:  This  function  sets  up  the  data  structure  for  a  2  layer  BNN.  It  takes  as  input 
variables  a  parameter  describing  the  number  of  inputs  and  neurons  in  the  hidden  and  output 
layer.  It  also  takes  as  another  input  variable  a  parameter  specifying  the  spreading  range  of 
the  biased  inputs. 

*  BP  2:  This  function  backpropagates  the  output  errors  to  the  input  layer.  It  takes  as 
input  variables  the  parameter  describing  the  weights  and  biases,  and  the  random  input, 
learning  rate  and  the  actual  input.  It  yields  updated  weights  and  biases. 

*  SIGMOID :  This  function  introduces  the  nonlinearity  to  the  neural  network. 

*  DSIG  :  This  function  takes  the  derivative  of  the  sigmoid  function. 


MATLAB  is  a  registered  trademark  of  the  Math  Works,  Inc. 
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These  functions  can  be  all  implemented  recursively,  and  imbedded  in  any  iterative 
loop.  The  source  codes  of  these  functions  used  in  this  thesis  research  are  provided  in 
Appendix  C. 
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V.  OBSERVATIONS,  DISCUSSIONS  AND  CONCLUSIONS 


A.  SUMMARY 

A  Direct  Model  Reference  Adaptive  Controller  (DMRAC)  for  LTI  systems  was 
developed  and  applied  to  both  minimum  phase  and  nonminimum  phase  LTI  systems  for  pole 
placement.  The  DMRAC  for  LTI  systems  was  then  extended  to  nonlinear  systems  by 
training  a  BNN  to  emulate  a  suitable  nonlinear  regression  form  that  describes  the  system 
under  consideration. 

Later,  the  control  of  four  general  classes  of  unknown  nonlinear  systems,  modelled  in 
discrete-time,  using  the  BNN  DMRAC  was  considered.  Lastly,  an  experiment  for  unknown 
nonlinear  nonminimum  phase  systems  was  conducted. 

B.  OBSERVATIONS,  DISCUSSIONS  AND  CONCLUSIONS 

A  DMRAC  designed  with  a  least-squares  estimator,  assuming  the  system  is  LTI,  failed 
to  work  for  most  nonlinear  systems.  Hence,  the  BNN  DMRAC  is  an  effective  technique  in 
controlling  nonlinear  systems  where  the  conventional  technique  fails. 

Some  experiments  conducted  showed  that  the  BNN  DMRAC  performs  its  control 
function  reasonably  well  for  various  types  of  inputs.  However,  for  inputs  with  high 
frequency  components  (with  respect  to  the  sampling  rate),  the  controlled  system  became 
quite  oscillatory,  so  that  it  would  not  be  able  to  track  the  reference  model  properly.  In 
addition,  the  controller  would  sometimes  saturate  during  training  and  therefore  it  failed  to 
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control  the  system  [Ref.  1].  It  is  suggested  that  the  solution  to  this  problem  is  to  increase  the 
sampling  rate  with  or  without  increasing  the  number  of  neurons  and  hidden  layers  used  in 
the  BNN.  Off-line  training  with  more  appropriate  data  (i.e.  training  signals  containing 
similar  b’equency  characteristics  as  the  actual  signals  experienced  by  the  controlled  system), 
and  adjusting  the  learning  parameters  also  helps  to  improve  the  tracking  performance  and 
avoid  saturation  [Ref.l].  Unfortunately,  there  is  still  no  general  rule  to  help  select  the  most 
appropriate  learning  parameters.  Hence,  a  great  deal  of  experimentation  is  usually  required. 

The  BNN  DMRAC  designed  for  unknown  nonminimum  phase  systems  is  not 
promising  compared  with  their  minimum  phase  counterpart.  A  nonminimum  phase  system 
is  usually  hard  to  stabilize  adaptively  by  a  direct  approach. 

C.  FURTHER  RESEARCH  AND  DEVELOPMENT 

In  this  thesis  research,  the  emphasis  was  to  develop  a  structure  for  direct  adaptive 
control  of  certain  classes  of  unknown  nonlinear  systems  using  the  BNN.  The  results  of  the 
experiments  clearly  showed  that  the  BNN  DMRAC  is  very  effective  in  controlling  unknown 
nonlinear  systems.  On  the  other  hand,  nonminimum  phase  systems  at  the  moment  do  not 
yield  satisfactory  results  and  need  more  research  efforts. 

Once  more,  in  the  design  of  the  BNN,  the  selection  of  the  number  of  layers,  neurons, 
the  type  of  nonlinear  transformation,  etc.,  is  still  at  the  discretion  of  the  designer.  This  area 
definitely  needs  further  research. 
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APPENDIX  A 


DMRAC  DESIGN  FOR  UNKNOWN  MINIMUM  PHASE  LTI 

SYSTEMS 

In  this  Appendix  we  address  the  problem  of  designing  an  adaptive  controller  for  a  LTI 
system  based  on  the  algebraic  approach.  The  unknown  LTI  system  to  be  controlled  is 
described  by  the  following  polynomial  operators: 

Mg)  -  OJg  *  0.9, 
m  =  3g. 

In  order  to  track  the  reference  model 

ym(0-0.8yJM)  =  v(M), 


(A-1) 


(A-2) 


the  desired  closed  loop  polynomial,  P*(g)  and  the  characteristic  polynomial  of  the 
observer,  a(g)  are  chosen  arbitrarily  and  with  roots  in  the  stable  region  as 


Pyg)  =  {3g(g-0.8)  =  g(g-0.8), 
3 

•(g)  =  g^  -  g*  025. 


(A-3) 


Using  the  algorithm  in  Chapter  1  Section  B,  the  Diophantine  equation  can  be  formed 


as  follows; 

(h^g  +  hj)(q^  -  0.2g  +  0.9)  *(kjg*  kj)3g  =  (g^  -  g  +  0.25)  [g  -  OJg  +  0.9  -  g(g  -  0.8)] 

(A-4) 
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or 


(h,q  *  -  0.2q  *  0.9)  ♦  (k^q  *  kj)3q  =  0.6q^  ♦  0.3q^  -  0.75q  *  022S.  (A-5) 


The  MATLAB  simulation  for  the  above  system  yields  the  following  steady-state  gains: 


h,  =  0.61234, 
hj  =  0.2493S, 
k,  =  0.0S678, 
k^  =  -0.42982. 


These  results  can  be  checked  by  using  the  Sylvester  matrix  formulations  for  this 
system  which  in  this  case  is  given  by 


1  0  0  0 

*1 

0.6 

-0.2  1  3  0 

0.3 

0.9  -0.2  0  3 

-0.75 

0  0.9  0  0 

0.225 

By  solving  equation  (A-6)  we  obtain  the  following  values: 

=  0.6, 
hj  =  0.25, 
kj  =  0.0567, 
kj  =  -0.4133. 


Hence,  the  estimated  gains  needed  to  control  the  system  are  found  to  be  very  close  to 
the  steady-state  gains  obtained  from  the  Sylvester  matrix  equations. 
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The  simulation  results  of  the  experiment  for  the  system  above  are  displayed  in  Figures 


(A- 1)  thru  (A-3). 


o’  o’  d 

aoniiiMovw  aaoLiNOviN 


Figur*  A-1 .  Reference  Input  and  Estimated  Control  Parameters  for  LH  Minimum 
Phase  Unknown  System 


TIME  (Sec.) 


DESIRED  OUTPUT  ym(t)  {->  vs  ACTUAL  OUTPUT  y(t)  { 
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Flgur*  A-3  .  Desired  Output  vs  Actual  Output  For  LTI  Minimuin  Phase  Unknown 
System 
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% 

A  DMRAC  DESIGN  FOR  UNKNOWN  MINIMUM  PHASE  LTI  SYSTEMS 

% 

Designing  a  controller  using  recursive  least  squares  estimates 
^  of  the  compensator  parameters. 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

% 

!  del  *.met 
clear.clg.clc 

Generating  the  reference  input  signal 
t=():l()0; 

v=square(t/(0.5*pi)): 
kmax=length(  V )+ 1 ; 
for  i=l:kmax-l 
if  v(i)<=() 
v(i)=0; 
end 
end 

%  Initialization  of  the  parameters 
u=zeros(l,kmax); 
ubar=zeros(l,kmax); 
y=zeros(l,kmax); 
ybar=zeros(  1  ,kmax); 
ym=zeros(  1  ,kmax); 
th=zeros(5,kmax); 
th(5,4)=0.1; 

P=l()()()()*eye(5); 

Generating  the  reference  model 
for  j=2:kmax- 1 
ym(j)=().8*ym(j- 1  )+v(j- 1 ); 
end 

*'/<  Designing  the  controller 
for  k=4;kmax- 1 
hl=th(l,k); 
h2=th(2,k); 
kl=th(3,k); 
k2=th(4,k); 
ibl=th(5,k); 
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u(k)=(h  1 + 1  )*u(k- 1  )+(h2-0.25)*u(k-2)+kl  *y{k- 1  )+k2*y(k-2)+... 

(v(k)-v(k- 1  H0.25*v(k-2))*ib  1 ; 
ubar(k)=u(k- 1  )-u(k-2)+0.25*u(k-3); 
y(k)=0.2*y(k- 1  )-0.9*y(k-2)+3*u(k- 1 ); 
ybar(k)=y(k)- 1 .8*y(k- 1  )+l  .05*y(k-2)-0.2*y(k-3); 
phit(k,:)=[u(k-2)  u(k-3)  y(k-2)  y(k-3)  ybar(k)); 
K(:,k)=(P*phit(k.:)')/(l+phit(k,:)*P*phit(k.:)’); 
P=P-(P*phit(k.:)'*phit(k,:)*P)/(l+phit(k.:)*P*phit(k,;)'); 
th(:,k+l)=lh(:,k)+K(:,k)*(ubar(k)-phil(k,:)*th(:J[;)); 
end 

'/{  Plotting  the  results 
subplot(21 1) 
axis([()  100-0.1  1.2]); 
plot(v(l,l:100)); 

tiUeCREFERENCE  INPUT  v(t)'); 

xlabeK'TIME  (Sec.)’); 

ylabel('MAGNITUDE'); 

grid.pause 

axis 

clear  t 

t=l:l()0; 

subplot(212) 

plot(t,th(  1,1:1 00),t,th(2, 1 : 1 00),t,th(3, 1 : 100),t,th(4, 1 ;  1 00),... 
t,th(5, 1:100)); 

titleCESTIMATED  CONTROL  PARAMETERS’); 

XlabeK'TIME  (Sec.)'); 

ylabel('MAGNITUDE'); 

grid 

gtextChl'); 

gtextCl/bl’); 

gtext('h2'); 

gtextCkl'); 

gtext('k2'); 

pause 

meta  1 

clg 

subplot(21 1) 
plot(ym(  1,1 :!()())); 

titleCTHE  SIGNAL  TO  BE  FOLLOWED'); 

XlabeK'TIME  (Sec.)'); 
ylabelCMAGNITUDE'); 
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grid.pause 

subpiot(212) 

plot(y(l.l:l()0)): 

tiUe(THE  OUTPUT  OF  OUR  PLANT); 

xlabeKTIME  (Sec.)'); 

ylabel('MAGNITUDE’); 

grid.pause 

meta  2 

clg 

axis([()  1()0  -2  4]) 
plot(ym(l.l;100)); 
hold  on 

plot(y(l.l:l00),'*'); 

plot(y(l,l:100)); 

dUeCDESIRED  OUTPUT  ym(t)  {-)  vs  ACTUAL  OUTPUT  y(t)  { *♦}'); 

xlabeKTIME  (Sec.)'); 

ylabel('MAGNITUDE'); 

grid 

hold  off 
meta  3 

axis('normal'); 
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APPENDIX  B 


DMRAC  DESIGN  FOR  UNKNOWN  NONMINIMUM  PHASE 

LTI  SYSTEMS 

In  this  Appendix  we  address  the  problem  of  designing  an  adaptive  controller  for  a 
nonminimum  phase  LTI  system  based  on  the  algebraic  approach.  The  unknown  LTI  system 
to  be  controlled  is  described  by  the  following  polynomials; 


Mq)  =  q^*q*I, 
B(q)  =  q*2, 


(B-1) 


where  A(q)  is  stable  and  monic,  and  B(q)  is  monic  and  causes  the  system  to  be 
nonminimum  phase.  In  order  to  track  the  reference  model 

yjt)  -  OJSyJt-l)  =  (B-2) 

the  desired  closed  loop  polynomial,  P‘(q)  and  the  characteristic  polynomial  of  the 
observer,  a(q)  are  chosen  arbitrarily  as 


P*(q)  =  (q-0J8f, 
^(q)  =  q^  -  q*025. 


(B-3) 


Using  the  algorithm  in  Chapter  1  Section  C,  the  Diophantine  equation  can  be  formed 

as  follows:  (B-4) 

(hjq  +  hjXq^  *q  +  J)*  (k^q  +  kjXq  +  2)  (q^-q*  0.25)  {q^  *q*  I- (q^  -  I.6q  *  0.64)\, 
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or. 


(h,q ♦  *q*I)* (k,q ♦  kj)(q *2)  =  JSq^  -  0.96q^  -  0J9q * 021.  (B-5) 


Also,  P*(q)m(q)is  obtained  as 


(q^-q*  OJSXq^  -  0.8q  *  0.16}  ^  q*  -  1.8q^  ♦  1.21q^  -  0.36q  ♦  0.041 


(B-6) 


The  MATLAB  simulation  for  the  above  system  yields  the  following  gains, 

hj  =  1.7927 
=  -2.121 
kj  =  -0.6184 
=  1.1666 
h,  =  0.0000 
bj  =  -0.6642 
c,  =  0.6642 
Cj  =  02358. 


where  0,,  Oj,  Cy  and  Cy  are  the  coefficients  of  the  polynomials  b{q)  and  c(q)  respectively. 

These  results  can  be  checked  with  Sylvester  matrix  solution  for  this  system  which  can 
be  written  as 


10  0  0 

1.8 

1110 

fh 

-0.96 

112  1 

-0.39 

0  10  2 

0.21 

- 

.**2 

» 

(B-7) 
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By  solving  equation  (B-7)  we  obtain  the  following  values; 

h,  =  L8, 

*,  =  -2J5, 
k,  =  -0.«, 

=  1.18. 

Hence,  the  estimated  gains  needed  to  control  the  system  are  found  to  be  very  close 
to  the  gains  obtained  from  Sylvester  matrix  equations.  In  fact,  only  h,,  hj,  k,  and  kj  are 
needed  in  the  control  phase  while  b,,  bj,  e,  and  Cj  are  needed  only  in  the  estimation  phase 
of  the  entire  design  process. 

The  simulation  results  of  the  experiment  for  the  system  above  are  displayed  in  Figures 
(B-l)thru  (B-3). 

The  experiment  then,  is  conducted  for  an  unknown  nonminimum  phase  and  unstable 
LTI  system.  It  is  observed  that  the  algorithm  first  stabilizes  the  system  with  the  adaptive  pole 
placement  process  and  then  provides  the  necessary  input  to  track  the  desired  reference 
signal.  The  results  of  this  experiment  for  the  nonminimum  phase  and  unstable  system  arc 
displayed  in  Figures  (B-4)  thru  (B-6).  For  the  experiment  A(q)  is  chosen  to  be  as  follows: 

A(q)  =  *q* 2.  (B-8) 
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Figura  B-l  .  Reference  Input  and  Estimated  Control  Parameters  for  LTI 
Nonminimum  Phase  Unknown  Stable  System 
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REFERENCE  INPUT  v(t) 


aanxiNovw 


aanjLiNOViM 


Figur*  B-4 .  Reference  Input  and  Estimated  Control  Parameters  for  LTI  Nonminimum 
Phase  Unknown  Unstable  System. 
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THE  SIGNAL  TO  BE  FOLLOWED 
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Figur*  B-5.  The  Signal  To  Be  Followed  and  the  Actual  Output  For  LTl 
Nonminimum  Phase  Unknown  Unstable  System. 
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Figure  B-6 .  Desired  Output  vs  Actual  Output  For  LTl  Nonminimum  Phase  Unknown 
Unstable  System. 
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%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

% 

% 

%  A  DMRAC  DESIGN  FOR  UNKNOWN  NONMINIMUM  PHASE  LTI  SYSTEMS 

7, 

%  Designing  a  controller  using  recursive  least  squares  estimates 
of  the  compensator  parameters. 

% 

%  Experimenting  with  a  stable  system 

7r 

%%%7c7c7c7c7c%%%%7c%%%%%%%%7c%%%%7c%7c%%7c%%%%%%%7€7c7c%7, 

7c 

!  del  *.met 
clear,clg,clc 

7(  Generating  the  reference  input  signal 

w=l  ()()()(); 
t={);w; 

v=square(t/(40*pi)); 

7c  Initialization  of  the  parameters 

kmax=length(v)+l; 
u=zeros(l,kraax); 
ubar=zeros(  1  ,kmax); 
uF=zeros(l,kmax); 
uFmin  1  =zeros(  1  ,kmax); 
y=zeros(l,kmax); 
yF=zeros(  1  ,kmax); 
yFmin  1  =zeros(  1  ,kmax); 
ym=zeros(l,kraax); 
th=zeros(8,kmax); 

P=w*eye(8); 

7c  Generating  the  reference  model 

for  j=2:lanax-l 
ym(.j)=().8*ymO-I)+v(j-l); 

end 
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%  Designing  the  controller 

for  k=6:kraax- 1 
hl=th(l.k); 
h2=th(2,k); 
kl=th(3,k); 
k2=th(4,k); 
bl=th(5,k); 
b2=th(6,k); 
cl=th(7,k); 
c2=th(8,k); 

u(k)=(h  1 + 1  )*u(k- 1  )+(h2-0.25)*u(k-2)+k  1  ♦y(k- 1  )+k2*y(k-2)+v(k)-v(k- 1  )+0.25*v(k-2); 
ubar(k)=u(k-2)-u(k-3)+0.25*u(k-4); 

uF(k)=u(k)- 1 .8*u(k- 1  )+l  .2 1  *u(k-2)-0.36*u(k-3)+0.04*u(k-4); 
uFmin  1  (k)=u(k- 1 )- 1 .8*u(k-2)+ 1.21  *u(k-3)-0.36*u(k-4)+0.04*u(k-5); 
y(k)=-y(k- 1  )-y(k-2)+u(k- 1  )+2*u(k-2); 
yF(k)=y(k)- 1 .8*y(k- 1 )+ 1 .2 1  *y(k-2)-0.36*y{k-3)+0.04*y(k-4); 
yFmin  1  (k)=y(k- 1 )- 1 .8*y(k-2)+ 1 .2 1  *y(k-3)-0.36*y(k-4)+0.04*y(k-5); 
phit(k,:)=[u(k-3)  u(k-4)  y(k-3)  y(k-4)  uF(k)  uFminl(k)  yF(k)  yFminl(k)]: 
K(;,k)=(P*phit(k.:)')/(l+phit(k.:)*P*phit(k.:)'); 
P=P-(P*phit(k.;)'*phit(k,:)*P)/(l+phit(k.:)*P*phit(k,:)’); 
th(:,k+l)=th(:,k)+K(:,k)*(ubar(k)*phit(k,:)*th(:,k)); 
end 

Plotting  the  results 
subplot(21 1) 
axis([0  w  *1.5  1.5]); 
plot(v(l,l;w)); 

tiUeCREFERENCE  INPUT  v(t)'); 

xlabeKTIME  (Sec.)’); 

ylabelCMAGNTTUDE'); 

grid,pause 

axis 

clear  t 

subplot(212) 

t=l:w; 

plot(t,th(  1 , 1  :w),t,th(2, 1  :w),t,th(3, 1  :w),t,th(4, 1  :w),... 

t,th(5, 1  :w),t,th(6, 1  .•w),t,th(7, 1  :w),t,th(8, 1  :w)); 
title(  ESTIMATED  CONTROL  PARAMETERS’); 

XlabeKTIME  (Sec.)’); 

ylabel(’MAGNITUDE’); 

grid; 

gtexU’hl'); 

gtext(’k2’); 
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gtext('cl'); 

gtext('c2'); 

gtext(’br); 

gtexiCkl'); 
gtexl(’b2'); 
gtext('h2'); 
pause 
meia  lb 


clg 

subpl«n(2n) 
axis([()  w  -10  10]); 
plot(yra(l,7:w)); 

titleCTHE  SIGNAL  TO  BE  FOLLOWED'); 

xlabeKTIME  (Sec.)'); 

ylabel('MAGNITUDE'); 

grid.pause 

subpiot(212) 

plot(y(l,l:w)); 

utleCTHE  OUTPUT  OF  OUR  PLANT'); 

xlabeK’TIME  (Sec.)'); 

ylabel('MAGNITUDE'); 

grid, pause 

meia  2b 


clg 

axis([0  w  -6  6]); 
pl<n(ym(l,l:w)); 
hold  on 

plot(().625*y(l,l;w).V); 

plot(().625*y(l,l:w)); 

titleCDESIRED  OUTPUT  yra(l){-}  vs  ACTUAL  OUTPUT  y(l){++}’); 

xlabeUTIME  (Sec.)'); 

ylabel(’MAGNlTUDE'); 

grid 

hold  oft" 
meta  3b 
axis 
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94 

9 

9,  A  DMRAC  DESIGN  FOR  UNKNOWN  NONMINIMUM  PHASE  LTl  SYSTEMS 

9, 

9<  Designing  a  controller  using  recursive  least  squares  estimates 
9i  of  the  compensator  parameters. 

9, 

94  Experimenting  with  a  unstable  system 

94 

94 

!  del  *.met 
clear,clg,clc 

94  Generating  the  reference  input  signal 

w=l()()0(); 

t=();w; 

v=square(t/(4()*pi)); 

94  Initialization  of  the  parameters 

kmax=lenglh(v)+l; 

u=zeros(l,kmax); 

ubar=zeros(  1  ,kmax); 

uF=zeros(l,kmax); 

uFmin  1  =zeros(  1  ,kmax): 

y=zeros(l  ,kmax); 

yF=zeros(  1  ,kraax): 

yFmin  1  =zeros(  1  ,kmax); 

ym=zeros(  l.kraax); 

ih=zeros(S,kmax); 

p=w*eye(8); 

94  Generating  the  reference  model 
for  j=2;kmax-I 

ym(j)={).8*ym(j- 1  )+v(j- 1 ); 
end 

94.  Designing  the  controller 
for  k=6;kmax- 1 

hl=th(l,k);h2=th(2,k);kl=th(3.k);k2=th(4,k);bl=th{5,k);b2=th(6,k); 
c  1  =th(7.k);c2=th(8,k); 

u(k)=(h  1 + 1  )*u(k- 1  )+(h2-0.25)*u(k-2)+k  1  *y(k- 1  )+k2*y(k-2)+v(k)-v(k- 1  )+0.25  *v(k-2); 
ubar(k)=u(k-2)-u(k-3  )+0. 25  *u(k-4); 

uF(k)=u(k)-1.8*u(k-l)+1.21*u(k-2)-0,36*u(k-3)+0.04*u(k-4); 
uFmin  1  (k)=u(k- 1 )- 1 , 8*u(k-2)+ 1.21  *u(k-3)-0  36*u(k-4)+0.04*u(k-5); 
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y(k)=-y(k- 1  )-2*y(k-2)+u(k- 1  )+2*u(k-2); 

yF(k)=y(k)- 1  8*y(k- 1 )+ 1 .2 1  *y(k-2)-0.36*y(k.3)-K).04*y(k-4); 

yFminl(k)=y(k-l)-1.8*y(k-2)+1.21*y(k-3)-0  36*y(k-4)+0.04*y(k-5); 

phit(k,;)=[u(k-3)  u(k-4)  y(k-3)  y(k-4)  uF(k)  uFminl(k)  yF(k)  yFniinl(k)]; 

K(;,k)=(P*phit(k,;)')/(l+phit(k,:)*P*phit(k,:y); 

P=P-(P*phit(k,:)’*phit(k,:)*P)/(l+phit(k,:)*P*phit(lc,;)'); 

th(:,k+l)=th(;,k)+K(:,k)*(ubar(k)-phit(k,:)*th(:,k)); 

end 

%  Plotting  the  results 
subplot(2 1 1 ) 
axis([0  w  -1  5  1.5]); 
plot(v(l,l:w)); 

titleCREFERENCE  INPUT  v(t)’); 

xlabeK’TIME  (Sec.)'); 

ylabel('MAGNITUDE'); 

grid,pause 

axis 

clear  t 

subplot(212) 

t=l:w, 

plot(t,th(  1 , 1  ;w),t,th(2, 1 ;  w),t,th(3, 1  :w),t,th(4, 1 ;  w),. . . 

t,th(5, 1  :w),t,th(6, 1  ;w),t,th(7, 1  :w),t,th(8, 1  :w)); 
titleCESTIMATED  CONTROL  PARAMETERS'); 
xlabeK'TIME  (Sec.)'); 
ylabel('MAGNITUDE'); 
grid; 

gtext('h  1  ');gtext('h2');gtext('c  1  ');gtext('c2'); 

gtext('bl');gtext('b2');gtext('k2');gtext('kr); 

pause 

meta  lb 

clg 

subplot(21 1) 
axis([0  w  -10  10]); 
plot(ym(l,7:w)); 

titleCTHE  SIGNAL  TO  BE  FOLLOWED'); 

xlabeK'TIME  (Sec.)'); 

ylabel(’MAGNITUDE'); 

grid,  pause 

subplot(212) 

plot(y(l,l:w)); 

titleCTHE  OUTPUT  OF  OUR  PL  ANT); 
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1 


xlabel(TIME  (Sec.)'); 

ylabelCMAGNITUDE'); 

grid,pause 

meta  2b 

clg 

axis([0  w  -6  6]); 
plot(ym(l,l:w)); 
hold  on 

plot(0.625*y(l.l:w),'+'); 

plot(0.625*y(l,l:w)); 

title(’DESIRED  OUTPUT  ym(t){-}  vs  ACTUAL  OUTPUT  y(t)'{-H-)); 

xlabeK'TIME  (Sec.)'), 

ylabel('MAGNITUDE’); 

grid 

hold  off 
meta  3b 
axis 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 
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APPENDIX  C 


BNN  SOFTWARE  SIMULATOR 


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

% 

%  SOFTWARE  SIMULATOR  FOR  BNN 

% 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

!  del  *.iTict 

clearxlg.clc 

rand('normal'); 

%  Random  input  to  train  the  system. 

num= 15000; 

ul=rand(l,num); 

k=0;num; 

9c  Actual  input  to  the  system. 

v=0.04*sin(k/(9*pi))-0.03*cos(k/(5*pi)); 

kmax=length(v); 

%  Calculating  the  reference  output. 

ym=2eros(  1  Junax- 1 ); 

y=zeros(l  Junax-l); 

utm  1  =zeros(  1  Jcmax- 1 ); 

mu=0.04; 

for  i=2-.kmax-l 

ym(i)=0.8*ym(i- 1  )+v(i- 1 ); 
end 

for  j=2:kmax-l 
%  model  1 

y(j+ 1  )=0.5*y(j)+0.4*y(j- 1  )+u  1  (jy'3+0.3*u  lQ)'^2-0. 1  *u  1 0); 

9c model2  y(j+ 1 )=(y(j)*y(j- 1  )*(y(j H2.5 )/( I  +y(jy'2+y0- 1  )''2))+u IQ); 

%model3  y(j+ 1  )=y(j)/(  ^  y(jy'2)+u  1  (j)'^3; 

%model4  y(j+l)=(y(j)*y(j-l)*yCj-2)*ul(j-l)*(yO-2)+l)+ul(j))/(l+ya'ir2+ya-2)^2); 
end 

9c  Initialization  of  the  neural  network. 
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[w3,w2,b2,w  1  .b  1  ]=initial2(  1 , 1 8, 1 8,5); 

%  Training  the  neural  network, 
for  t=4;kmax-l 

u=[ul(t-2)  ul(t-3)  y(t-2)  y(t-3)  (y(t)-0.8*y(t-l))]'; 
utm  1  (t- 1  )=net2(w3,  w2,b2,w  1  ,b  1  ,u); 

[  w3,w2,b2,  w  1  ,b  1  ]=bp2(w3,w2,b2,w  1  ,b  1  ,u,u  1  (t- 1  ),mu); 
end 

%  Plots  of  the  desired  signals. 
subplot(221) 
axis([0  200  -4  4]); 
plot(ul  (1:200)); 
titleCRANDOM  INPUT); 
grid 

subplot(222) 
axis([0  200  -4  4]); 
plot(utml(l:2(X))); 

titleCOUTPUT  OF  THE  NEURAL  NETWORK  ); 
grid 

subpIot(223) 
axis([0  1000-0.1  0.1]); 
plot(v(  1:1000)); 
tiUe('INPUT’); 
grid 

subplot(224) 

axis([0  1000-0.4  0.4]); 

plot(ym(l;1000)): 

tiUeCDESIRED  RESPONSE  ); 

grid,pause 

meta  h 

%  Controlling  the  plant  with  a  neural  identifier-controller, 
clg 

clear  u;  clear  y;clear  t; 
u=zeros( 1,1 0000); 
y=zeros(  1,1 0000); 
il=zeros(5, 10000); 
i2=zeros(5, 10000); 
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for  t*4:9999; 

il(:,t)=(u(t-l)  u(t*2)  y(t-l)  y(t-2)  v(t)l'; 
u(t)3:net2(w3,w2,b2.w  1  ,bl  ,i  1 
y(t)=(y(t- 1  )*y(t-2)/(  l+y(t- 1  )^2+y(t-2)^2))+u(t- 1 ); 
i2(:,t)=(u(t.2)  u(t-3)  y(t-2)  y(t-3)  (y(t)-0.8*y(M))l*; 

[  w3,w2,b2,  w  1  ,b  1  ]=l^2(w3,w2,b2,w  1  ,b  1  .i2(:,t),u(t- 1  ),inu); 
end 

%  Ploting  the  results. 

subplot(  111) 
n= 1:999; 

axis([0  1000  -0.6  0.6]); 

plot(n,ym(  1 , 1 :999),n,y(  1 , 1 :999),'.'); 

titleCDESIRED  OUTPUT  {-]  vs  ACTUAL  OUTPUT  {..}*); 

grid,pause 

meta  hh 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

% 

%  BNN  DMRAC  WITH  UNKNOWN  NONLINEAR  NONMINIMUM 
9c  PHASE  SYSTEM 

9c 

%9c%9c9c%%9c9c%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

9c%% 

!  del  *.met 

clear,clg,clc 

rand('nonnal'); 

9c  Random  input  to  train  the  system. 

num=1000; 

ul=rand(  1,500000); 

k=0:num; 

9c  Actual  input  to  the  system. 
v=0.03*square(k/(  10*pi)); 

7cv=0.04*sin(ky(9*pi))-0.03*cos(k/(5*pi)); 

kmax=iength(v); 

%  Calculating  the  reference  output. 
ym=zeros(  1  Jkmax- 1 ); 
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y=zcros(l  Jcmax-1); 
utm  1  =zeros(  1  Jcmax- 1 ); 
mu=0.099; 
for  i=2:kmax-l 

ym(i)=0.8*ym(i- 1  )+v(i- 1 ); 
end 

for  j=2:kmax-l 
%model2 

y(j+ 1  )=(y(j)*yO- 1  )*(y0)-*-2.5)/(  i  +y0y'2-»-y0- 1  )^2))+u  1 0H2*u  1 0- 1 ); 

%model3  y(j+l)=y(j)/(l+y0)^2)+ul(jr3; 

%modcl4  y(j+ 1  )=(y(j)*y(j- 1  )*y(j-2)*u  1  (j- 1  )*(y(i-2)+l  )+u  1 0))/(  1  +y0'  1  )^2+y0-2)^2); 

^  model  1  y0+ 1  )=0.5*y(j)+0.4*y(j- 1  )+u  I  (j)''3+0.3*u  1  ay'2-0. 1  *u  1 0); 

end 

%  Initialization  of  the  neural  network. 

[  w3,w2,b2,w  1  ,b  1  ]=initial2(  1 ,20,20,5); 

%  Training  the  neural  network, 
for  t=4;kmax-l 

u=(ul(t-2)  ul(t-3)  y(t-2)  y(t-3)  (y(t)-0.8*y(M))]’; 
utm  1  (t- 1  )=net2(w3,w2,b2,w  1  ,bl  ,u); 

( w3,w2,b2,w  1  ,b  1  ]=bp2(w3,w2,b2,w  1  ,bl  ,u,u  l(t- 1  ),mu); 
end 

%  Plots  of  the  desired  signals. 

subplot(221) 
axis([0  200  -4  4]); 
plot(ul(l:200)); 
titleCRANDOM  INPUT); 
grid 

subplot(222) 
axis([0  200  -4  4]); 
plot(utml  (1:200)); 

titleCOUTPUT  OF  THE  NEURAL  NETWORK'); 
grid 

subplot(223) 
axis([0  1000*0.1  0.1]); 
plot(v(  1:1000)); 


71 


tiUc('INPUT); 

grid 

subplot(224) 

axis([0  1000-0.4  0.4]); 

plot(ym(  1:1000)); 

titlcCDESIRED  RESPONSE  ); 

grid,pause 

meta  h 

%  Controlling  the  plant  with  a  neural  identifier-controller, 
clg 

clear  u 
clear  y 
clear  t 

u=zeros(l,num); 

y=zeros(l,num); 

il=zeros(5,num); 

i2=zeros(5,num); 

for  t=4:999; 

il(:,t)=[u(t-l)  u(t-2)  y(t-l)  y(t-2)  v(t)J’; 
U(t)=net2(w3,w2,b2,wl,bl,il(:,t)); 
y  (t)=(y(t- 1  )*y  (t-2)/(  1  +y(t- 1  )^2-t-y(t-2)^2))-m(t- 1 ); 
i2(:,t)=[u(t-2)  u(t-3)  y(t-2)  y(t.3)  (y(t)-0.8*y(t-l))]'; 

[  w3,w2,b2,  w  1  ,b  1  ]=bp2(w3,w2,b2,w  1  ,b  1  ,i2(:,t),u(t- 1  ),mu); 
end 

9c  Rioting  the  results. 

subplot(l  1 1) 
n=  1:999; 

axis([0  1000-0.6  0.6]); 

plot(n,ym(  1 , 1 :999),n,y(  1 , 1 :999),’.'); 

titleCDESIRED  OUTPUT  {-}  vs  ACTUAL  OUTPUT  {..}'); 

grid,pause 

meta  hh 


9c9c9c9c9c9c9c9c9c9c9c9c9c%9c9c%%9c%%%%%9c%%%%%%%%%%%%%9c%9c%%9c 
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%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

% 

function  [W3,W2,b2,W  1  ,bl ]ainitial2(ny,M2,M  1  ,nu) 

% 

%  initializes  the  neural  network 

% 

%  ny=number  of  outputs,  nu^number  of  inputs, 

%  M2,M  1  =number  of  neurons  in  hidden  layers 

% 

rand('normal') 

W3=rand(ny,M2); 

W2=rand(M2,Ml);  b2=rand(M2,l); 

Wl=ones(Ml,nu);  bl=rand(Ml,l); 

end 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

% 

function  y=net2(W3,W2,b2,Wl,bl,u) 

7c 

7c  sets  up  the  data  structure  for  2  layer  neural  network 
7c  u=input,  y=output 
7c  W=weights,  b=biases 
7c 

v=sigmoid(W  1  *u+b  1 ); 

z=sigmoid(W2*v+b2); 

y=W3*z; 

end 

7c7c7c7c7c7c7c7c7c7c%7c7c%7c7c%%%%%%%7c7c%%%%%7c%7c%7c7c%%%7c%%7c% 

7c7c7c7c7c7c7c7c7c7c7c7c7c7c7c7c7c7c7c7c7c7c7c7c7c7c7o7c7c7c7c7c7c7c7c7c7c7c7c7c7c7c7c7c 

7c 

function  [W3,W2,b2,W  1  ,b  1  ]=bp2(W3,W2,b2,W  1  ,b  1  ,u,yd,mu) 

7c 

7c  updates  the  weights  and  the  biases  of  a  2  layer  neural  network  with  backpropagation 

7c 

7c  u  is  the  input  to  the  network, 

7c  yd  is  the  desired  reference  model  to  track 
7c  mu  is  the  learning  rate 

vb=Wl*u+bl;  v=sigmoid(vb); 
zb=W2*v+b2;  z=sigmoid(zb); 
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y=W3*z; 

e=y-yd; 


d3=e; 

d2=diag(W3'*d3)*dsig(zb); 
d  1  =diag(  W2'*d2)*dsig(vb); 

W3=W3-mu*d3*z'; 

W2=W2-mu*d2*v';  b2=b2-mu*d2; 

Wl=Wl-mu*dl*u';  bl=bl-mu*dl; 


end 

%^<^%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 

% 

function  y=sigmoid(x) 

% 

%  introduces  the  nonlinearity  to  neural  network 

% 

x=min(x,100);  x=max(x,-100); 
y=  (l-exp(-x))./(l+exp(-x)); 
end 

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 


9c 

function  d=dsig(x) 

9c 

9c  takes  the  derivative  of  sigmoid 
9c 

9c  X  is  a  vector 
9c 

x=min(x,100);  x=max(x,-100); 
temp=exp(-x); 

d=temp  ./  (l+2*temp  +  temp  .*  temp); 
end 

9c9c9c9c9c9c9c9c9c9c%9c%%9c9c9c9c9c9o9c9c9c9c9c%9c9o9o9c9c9o9c9c%9c9c9o9c9c9c9c9c9c 
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